Classifications are perhaps the most natural tool humans use to organize information content. Information items are hierarchical arranged under topic nodes moving from general ones to more specific ones as long as we go deep in the hierarchy. This attitude is well known in Knowledge organization as the principle of organizing from the general to the specific [1], called synthetically the get-specific principle in [2].
Classifications content is usually described using natural language labels, which has been proved to be very effective in manual tasks (e.g. to index documents, to search and navigate the tree). However, natural language labels show their limitations when one tries to automate reasoning over them, for instance for automatic indexing and semantic matching or when dealing with multiple languages.
Therefore, a fundamental preliminary step is to translate classifications into their formal alter-ego, namely into lightweight ontologies. Following the approach described in [2] and exploiting dedicated Natural language processing (NLP) techniques tuned to short phrases (for instance, as described in [3]), each node label can be translated into an unambiguous formal expression, i.e. into a propositional Description Logic (DL) expression. As a result, lightweight ontologies, or formal classifications, are tree-like structures where each node label is a language-independent propositional DL formula codifying the meaning of the node. Taking into account its context (namely the path from the root node), each node formula is subsumed by the formula of the node above. As a consequence, the backbone structure of a lightweight ontology is represented by subsumption relations between nodes.
For example, in case a node labeled “car” is under a node labeled “red” we can say that the meaning of the node “car” is “red car” in this case. This is translated into the logical formula “red AND car”.
[2] provides some examples of lightweight ontologies.[4] and [5] show how lightweight ontologies can be used to automate important tasks, in particular to favor interoperability among different knowledge organization systems.